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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: BONNEFOY, JEAN- YVES 

GAUCHAT, JEAN-FRANCOIS 

(ii) TITLE OF INVENTION: SUBSTANCES AND THEIR USES 

(iii) NUMBER OF SEQUENCES: 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

(B) STREET: 1100 NORTH GLEBE ROAD 

(C) CITY: ARLINGTON 

(D) STATE: VIRGINIA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 22201-4714 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: MS Word 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 29-SEP-2003 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/545,002 

(B) FILING DATE: 06-APR-2 000 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/969,125 

(B) FILING DATE: 12-NOV-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9625899.1 

(B) FILING DATE: 13-DEC-1996 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Tyr Xaa Xaa Gin 
1 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 



(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Trp Ser Xaa Trp Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(v) FRAGMENT TYPE: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CTGAGCTACA TGAAGTGTTC TTGGCTCCCT 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(v) FRAGMENT TYPE: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
CAGAGTTTGT CATCCTCATA GCATAACTTA 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(v) FRAGMENT TYPE: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AATACCACTC CCGACACTAA CTATACTCTC 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(v) FRAGMENT TYPE: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGAGAAATGG CTTTCGTTTG CTTGGCTATC 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(v) FRAGMENT TYPE: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
TACCATGTCT CTTGATATGG AAAGTCTTCA 
(2) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4038 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(v) FRAGMENT TYPE: linear 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 43.. 1323 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
GCCAAGGCTC CAGCCCGGCC GGGCTCCGAG GCGAGAGGCT GC ATG GAG TGG CCG 

1 
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40 45 
AAT CCA CCC GAG GGA GCC AGC TCA AAT TGT AGT CTA TGG TAT TTT ACT 



Asn Pro Pro Glu Gly Ala Ser Ser Asn Cys Ser Leu Trp Tyr Phe Ser 
55 60 

S E £ S E - S E - S S S E E E S 

2 s e e s s e - is s s e a e s e 

85 9° 95 

S° E 52 £ US EE E E E E S 3 E E E 

105 110 

55 K E E £ E S £ 5 E E S E E - E 

120 125 
CAA TGC ATT TGG CAC AAC CTG AGO TAG ATG AAG TGT TCT TGG CTC CCT 
Cys lie Trp His Asn Leu Ser Tyr Met Lys Cys Ser Trp Leu Pro 
135 I 40 145 

S S E E E 5 E E E E E S E E S E 
E E S ^ E E E E E E E £ E S K S 
£ E E £ E E S E S E E E E E E E 

185 190 

E E E E E E E £ S E E S E E E E 
£ E S E E E S E E E E E E E E S 

215 220 225 

rAT CCT CCA CAT ATT AAA AAC CTC TCC TTC CAC AAT GAT GAC CTA TAT 
Sp Pro Pro Ss fie Lys Asn Leu Ser Phe His Asn Asp Asp Leu Tyr 
230 235 ^ 

E E S E E £ E E E S E E E E E E 
E E E E E E E E E E E E E E E E 

GTC CAA GAG GCT AAA TGT GAG AAT CCA GAA TTT GAG AGA AAT GTG GAG 
vS Glu Ala Lys Cys Glu Asn Pro Glu Phe Glu Arg Asn Val Glu 

280 285 
AAT ACA TCT TGT TTC ATG GTC CCT GGT GTT CTT CCT GAT ACT TTG AAC 
A^n ?£ Ser Cys Phe Met Val Pro Gly Val Leu Pro Asp Thr Leu Asn 

ACA GTC AGA ATA AGA GTC AAA ACA AAT AAG TTA TGC TAT GAG GAT GAC 
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Thr val Arg He Arg Val Lys Thr Asn Lys Leu Cys Tyr Glu Asp Asp 

310 315 320 

AAA CTC TGG AGT AAT TGG AGO CAA GAA ATG AGT ATA GGT AAG AAG CGC 
™ Leu lr P Ser Tsn Trp Ser Gin Glu Met Ser He Gly Lys Lys Arg 

325 330 J- 5 - 3 

AAT TCC ACA CTC TAC ATA ACC ATG TTA CTC ATT GTT CCA GTC ATC GTC 
£n ser ?£ Leu Tyr He Thr Met Leu Leu He Val Pro Val He Val 
345 350 

CCA CCT GCA ATC ATA GTA CTC CTG CTT TAC CTA AAA AGG CTC AAG ATT 
Ala Gly aS He He" Val Leu Leu Leu Tyr Leu Lys Arg Leu Lys He 
360 365 

ATT ATA TTC CCT CCA ATT CCT GAT CCT GGC AAG ATT TTT AAA GAA ATG 
111 He" Phe Pro Pro He Pro Asp Pro Gly Lys He Phe Lys Glu Met 
375 380 385 

TTT GGA GAC CAG AAT GAT GAT ACT CTG CAC TGG AAG AAG TAC GAC ATC 
III Gly Asp Gin Asn Asp Asp Thr Leu His Trp Lys Lys Tyr Asp He 

390 395 400 

TAT GAG AAG CAA ACC AAG GAG GAA ACC GAC TCT GTA GTG CTG ATA GAA 
■Eyr Glu Lys Gin Thr Lys Glu Glu Thr Asp Ser Val Val Leu He Glu 
405 410 4i 

AAC CTG AAG AAA GCC TCT CAG TGATGGAGAT AATTTATTTT TACCTTCACT 
Asn Leu Lys Lys Ala Ser Gin 
425 

GTGACCTTGA GAAGATTCTT CCCATTCTCC ATTTGTTATC TGGGAACTTA TTAAATGGAA 
ACTGAAACTA CTG CAC CAT T T AAAAAC AG G CAGCT CAT AA GAGCCACAGG TCTTTATGTT 
GAGTCGCGCA CCGAAAAACT AAAAATAATG GGCGCTTTGG AGAAGAGT GT GGAGTCATTC 
T CAT T G AAT T ATAAAAGCCA GCAGGCTTCA AACTAGGGGA CAAAGCAAAA AGT GAT GAT A 
GTGGTGGAGT TAATCTTATC AAGAGTTGTG ACAACTTCCT GAGGGATCTA TACTTGCTTT 
GTGTTCTTTG T GT C AAC AT G AACAAATTTT ATTTGTAGGG G AAC T CAT T T GGGGTGCAAA 
TGCTAATGTC AAACT T GAGT CACAAAGAAC AT GT AGAAAA C AAAAT G GAT AAAAT C T GAT 
ATGTATTGTT TGGGATCCTA T T G AAC CAT G TTTGTGGCTA TTAAAACTCT TTTAACAGTC 
TGGGCTGGGT CCGGTGGCTC ACGCCTGTAA TCCCAGCAAT TTGGGAGTCC GAGGCGGGCG 
GATCACTCGA GGT CAGGAGT T C CAG AC CAG CCTGACCAAA AT GGT GAAAC CTCCTCTCTA 
CTAAAACTAC AAAAATTAAC TGGGTGTGGT GGCGCGTGCC TGTAATCCCA GCTACTCGGG 
AAGCTGAGGC AGGT GAATTG TTTGAACCTG GGAGGT GGAG GTTGCAGTGA GCAGAGATCA 
CACCACTGCA CTCTAGCCTG GGT GAC AGAG CAAGACT CTG TCTAAAAAAC AAAACAAAAC 
AAAACAAAAC AAAAAAACCT CTTAATATTC TGGAGTCATC ATTCCCTTCG ACAGCATTTT 
CCTCTGCTTT GAAAGCCCCA GAAATCAGTG TTGGCCATGA T GACAACT AC AGAAAAACCA 
GAGGCAGCTT CTTTGCCAAG ACCTTTCAAA GCCATTTTAG GCTGTTAGGG GCAGTGGAGG 
TAGAATGACT CCTTGGGTAT TAGAGTTTCA AC CAT GAAGT CTCTAACAAT GTATTTTCTT 
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CACCTCTGCT ACT GAAGT AG CAT TTACT GT GTCTTTGGTT TGTGCTAGGC CCCCGGGTGT 2433 

GAAGCACAGA CCCCTTCCAG GGGTTTACAG T CT AT TT GAG ACTCCTCAGT TCTTGCCACT 2493 

TTTTTTTTTA ATCTCCACCA GTCATTTTTC AGACCTTTTA ACTCCTCAAT TCCAACACTG 2553 

ATTTCCCCTT TTGCATTCTC CCTCCTTCCC TTCCTTGTAG CCTTTTGACT TTCATTGGAA 2 613 

ATTAGGAT GT AAATCTGCTC AGGAGACCT G GAGGAGCAGA GGATAATTAG CAT CT CAGGT 2 673 

TAAGTGTGAG TAAT CT GAGA AACAATGACT AATTCTTGCA TAT T T T GT AA CTTCCATGTG 2733 

AGGGTTTTCA GCATT GAT AT TTGTGCATTT T CTAAACAGA GAT GAGGT GG TATCTTCACG 2793 

T AG AAC AT T G GTATTCGCTT GAGAAAAAAA GAATAGTTGA ACCTATTTCT CTTTCTTTAC 2853 

AAGATGGGTC CAGGATTCCT CTTTTCTCTG CCATAAATGA TTAATTAAAT AGCTTTTGTG 2913 

T CT T AC AT T G GTAGCCAGCC AGCCAAGGCT CTGTTTATGC TTTTGGGGGG CAT AT AT T GG 2973 

GTTCCATTCT CACCTATCCA CACAACATAT CCGTATATAT CCCCTCTACT CTTACTTCCC 3033 

CCAAATTTAA AGAAGT AT G G GAAAT GAGAG GCATTTCCCC CACCCCATTT CTCTCCTCAC 3093 

AC AC AG AC T C ATATTACT GG TAGGAACTTG AGAACTTTAT TTCCAAGTTG TTCAAACATT 3153 

TACCAATCAT ATTAATACAA TGATGCTATT TGCAATTCCT GCTCCTAGGG GAGGGGAGAT 3213 

AAGAAACCCT CACTCTCTAC AGGTTTGGGT ACAAGT GGCA ACCTGCTTCC ATGGCCGTGT 3273 

AGAAGCAT GG TGCCCTGGCT TCTCTGAGGA AGCTGGGGTT CAT GAC AAT G GCAGATGTAA 3333 

AGTTATTCTT GAAGT CAGAT TGAGGCTGGG AGACAGCCGT AGTAGAT GTT CTACTTTGTT 3393 

CTGCTGTTCT CTAGAAAGAA TATTTGGTTT TCCTGTATAG GAATGAGATT AATTCCTTTC 3453 

CAGGTATTTT ATAATTCTGG GAAGCAAAAC CCATGCCTCC CCCTAGCCAT TTTTACTGTT 3513 

ATCCTATTTA GATGGCCATG AAGAGGAT GC TGTGAAATTC C C AACAAACA TTGATGCTGA 357 3 

CAGTCATGCA GTCTGGGAGT G G G GAAGT GA TCTTTTGTTC CCATCCTCTT CT T T TAG CAG 3633 

TAAAATAGCT GAGGGAAAAG GGAGGGAAAA G GAAGT TAT G GGAATACCTG TGGTGGTTGT 3693 

GATCCCTAGG TCTTGGGAGC TCTTGGAGGT GT CT GT AT CA GTGGATTTCC CATCCCCTGT 3753 

GGGAAATTAG TAGGCTCATT TACTGTTTTA GGTCTAGCCT ATGTGGATTT TTTCCTAACA 3813 

TACCTAAGCA AACCCAGTGT CAGGAT GGTA ATTCTTATTC TTTCGTTCAG TTAAGTTTTT 3873 

CCCTTCATCT GGGCACTGAA GGGATATGTG AAACAATGTT AACATTTTTG GTAGTCTTCA 3933 

ACCAGGGATT GTTTCTGTTT AACTTCTTAT AGGAAAGCTT GAGTAAAATA AAT AT T GT CT 3993 
TTTTGTATGT CAAGCGGGCC GCCACCGCGG TGGAAACTCC AGCTT 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



4038 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Glu Trp Pro Ala Arg Leu Cys Gly Leu Trp Ala Leu Leu Leu Cys 
15 10 15 

Ala Gly Gly Gly Gly Gly Gly Gly Gly Ala Ala Pro Thr Glu Thr Gin 
20 25 30 

Pro Pro Val Thr Asn Leu Ser Val Ser Val Glu Asn Leu Cys Thr Val 
35 40 45 

He Trp Thr Trp Asn Pro Pro Glu Gly Ala Ser Ser Asn Cys Ser Leu 
50 * 55 60 

Trp Tyr Phe Ser His Phe Gly Asp Lys Gin Asp Lys Lys He Ala Pro 
65 70 75 80 

Glu Thr Arg Arg Ser He Glu Val Pro Leu Asn Glu Arg He Cys Leu 
85 90 95 

Gin Val Gly Ser Gin Cys Ser Thr Asn Glu Ser Glu Lys Pro Ser He 
100 105 HO 

Leu Val Glu Lys Cys He Ser Pro Pro Glu Gly Asp Pro Glu Ser Ala 
115 120 125 

Val Thr Glu Leu Gin Cys He Trp His Asn Leu Ser Tyr Met Lys Cys 
130 135 140 

Ser Trp Leu Pro Gly Arg Asn Thr Ser Pro Asp Thr Asn Tyr Thr Leu 
145 150 155 160 

Tvr Tyr Trp His Arg Ser Leu Glu Lys He His Gin Cys Glu Asn He 
165 170 175 

Phe Arg Glu Gly Gin Tyr Phe Gly Cys Ser Phe Asp Leu Thr Lys Val 
180 185 190 

Lys Asp Ser Ser Phe . Glu Gin His Ser Val Gin He Met Val Lys Asp 
195 200 205 

Asn Ala Gly Lys He Lys Pro Ser Phe Asn He Val Pro Leu Thr Ser 
210 215 220 

Arq Val Lys Pro Asp Pro Pro His He Lys Asn Leu Ser Phe His Asn 
225 230 235 240 

Asp Asp Leu Tyr Val Gin Trp Glu Asn Pro Gin Asn Phe He Ser Arg 
245 250 255 

Cys Leu Phe Tyr Glu Val Glu Val Asn Asn Ser Gin Thr Glu Thr His 
260 265 270 

Asn Val Phe Tyr Val Gin Glu Ala Lys Cys Glu Asn Pro Glu Phe Glu 
275 280 285 

Arq Asn Val Glu Asn Thr Ser Cys Phe Met Val Pro Gly Val Leu Pro 
290 295 300 

Asp Thr Leu Asn Thr Val Arg He Arg Val Lys Thr Asn Lys Leu Cys 



Tyr Glu Asp Asp Lys Leu Trp Ser Asn Trp Ser Gin Glu Met Ser He 
325 330 335 

Gly Lys Lys Arg Asn Ser Thr Leu Tyr He Thr Met Leu Leu He Val 
340 345 350 

Pro Val He Val Ala Gly Ala He He Val Leu Leu Leu Tyr Leu Lys 
355 360 365 

Arg Leu Lys He He He Phe Pro Pro He Pro Asp Pro Gly Lys He 
370 375 380 

Phe Lys Glu Met Phe Gly Asp Gin Asn Asp Asp Thr Leu His Trp Lys 
385 390 395 400 

Lvs Tyr Asp He Tyr Glu Lys Gin Thr Lys Glu Glu Thr Asp Ser Val 
405 " 410 415 

Val Leu He Glu Asn Leu Lys Lys Ala Ser Gin 
420 425 



